IDIAP Martigny - Valais - Suisse Continuous Audio � Visual Speech Recognition

نویسنده

Juergen Luettin

چکیده

We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audiovisual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We t a c kle the problem of joint temporal modelling of the acoustic and visual speech signals by applying Multi-Stream hidden Markov models. This approach allows the use of diierent temporal topologies and levels of stream integration and hence enables to model temporal dependencies more accurately. T h e system has been evaluated for a continuously spoken digit recognition task of 37 subjects.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

IDIAP Martigny - Valais - Suisse Fast Object Detection using MLP and FFT

We propose a new technique that speeds up signi cantly the time needed by a trained network MLP in our case to detect a face in a large image We reformulate neural activities in the hidden layer of the MLP in terms of lter convolution enabling the use of Fourier transform for an e cient computation of the neural activities A formal proof and a complexity analysis are presented Finally some exam...

متن کامل

Martigny - Valais - Suisse

متن کامل

IDIAP Martigny - Valais - Suisse Multi � Modal Data Fusion for Person Authentication

In the context of multi modal person authentication a set of experts face recognizer speaker recognizer etc give their opinion about the identity of an individual The opinions of the experts can be combined to form a nal decision rejecting or accepting the claim We show that the nal decision is a binary classi cation problem and propose to solve it by a Support Vector Machine SVM We compare our...

متن کامل

IDIAP Martigny - Valais - Suisse Optimal Parameterization of Point Distribution Models Georg Thimm Juergen

We address the problem of determining the optimal model complexity for shape modeling This complexity is a compromise between model speci city and generality We show that the error of a model can be split into two components the model error and the tting error of which the rst one can be used to optimize the model complexity based on the speci c application This strategy improves over tradition...

متن کامل

Martigny - Valais - Suisse Illumination � robust Pattern Matching using Distorted Histograms Georg

It is argued that global illumination should be modeled separately from other incidents that change the appearance of objects The e ects of intensity variations of the global illumination are discussed and constraints deduced that restrict the shape of a function that maps the histogram of a template to the histogram of an image location This approach is illustrated for simple pattern matching ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

IDIAP Martigny - Valais - Suisse Continuous Audio � Visual Speech Recognition

نویسنده

چکیده

منابع مشابه

IDIAP Martigny - Valais - Suisse Fast Object Detection using MLP and FFT

Martigny - Valais - Suisse

IDIAP Martigny - Valais - Suisse Multi � Modal Data Fusion for Person Authentication

IDIAP Martigny - Valais - Suisse Optimal Parameterization of Point Distribution Models Georg Thimm Juergen

Martigny - Valais - Suisse Illumination � robust Pattern Matching using Distorted Histograms Georg

عنوان ژورنال:

اشتراک گذاری